The multimodal nature of spoken word processing in the visual world: Testing the predictions of alternative models of multimodal integration

نویسندگان

  • Alastair C. Smith
  • Padraic Monaghan
چکیده

Ambiguity in natural language is ubiquitous (Piantadosi, Tily & Gibson, 2012), yet spoken communication is effective due to integration of information carried in the speech signal with information available in the surrounding multimodal landscape. However, current cognitive models of spoken word recognition and comprehension are underspecified with respect to when and how multimodal information interacts in the cognitive system. Within this study we investigate this issue by comparing two computational models both of which frame spoken word recognition and speech comprehension in terms of multimodal constraint satisfaction. Both models permit the integration of concurrent information within linguistic and non-linguistic processing streams, however their architectures differ critically in the level at which multimodal information interacts. We compare the predictions of the Multimodal Integration Model (MIM) of language processing (Smith, Monaghan & Huettig, 2014), which like 'hub and spoke' models of semantic processing (Plaut, 2002; Rogers et al., 2004; Dilkina, McClelland, & Plaut, 2008), implements full interactivity between modalities, to a model in which interaction between modalities is restricted to lexical representations which we represent by an extended multimodal version of the TRACE model of spoken word recognition (McClelland & Elman, 1986). Language mediated visual attention requires visual and linguistic information integration and has thus been used to examine properties of the architecture supporting multimodal processing during spoken language comprehension (Huettig, Rommers & Meyer, 2011). We generate predictions from these alternative models for the influence of visual, semantic and phonological rhyme similarity on language mediated visual attention that are then tested in two visual world experiments. Our results demonstrate that previous visual world data sets involving phonological onset similarity are compatible with both models, whereas our novel experimental data on rhyme similarity is able to distinguish between competing architectures. The fully interactive MIM system correctly predicts a greater influence of visual and semantic information relative to phonological rhyme information on gaze behaviour, while by contrast a system that restricts multimodal interaction to the lexical level overestimates the influence of phonological rhyme, predicting stronger effects of phonological rhyme relative to semantic and visual information, thereby providing an upper limit for when information interacts in multimodal tasks. We discuss the continued under-specification of the representational structures and cognitive architecture supporting multimodal language processing and how novel properties of the deep learning approach offer potential for new insight on these issues that are fundamental to our understanding of language processing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A comprehensive model of spoken word recognition must be multimodal: Evidence from studies of language mediated visual attention

When processing language, the cognitive system has access to information from a range of modalities (e.g. auditory, visual) to support language processing. Language mediated visual attention studies have shown sensitivity of the listener to phonological, visual, and semantic similarity when processing a word. In a computational model of language mediated visual attention, that models spoken wor...

متن کامل

A Critical Visual Analysis of Gender Representation of ELT Materials from a Multimodal Perspective

This content analysis study, employing a multimodal perspective and critical visual analysis, set out to analyze gender representations in Top Notch series, one of the highly used ELT textbooks in Iran. For this purpose, six images were selected from these series and analyzed in terms of ‘representational’, ‘interactive’ and ‘compositional’ modes of meanings. The result indicated that there are...

متن کامل

A Multimodal Discourse Analysis of Some Visual Images in the Political Rally Discourse of 2011 Electioneering Campaigns in Southwestern Nigeria

This paper presented a multimodal discourse analysis of some visual images in the political rally discourse of 2011 electioneering campaigns in Southwestern Nigeria. The data comprised purposively selected political visual artefacts from political rallies across the six Southwestern States in Nigeria (Osun, Oyo, Ondo, Ekiti, Ogun, and Lagos). The data were analyzed using Halliday’s (1985) syste...

متن کامل

A Decision Support System for Urban Journey Planning in Multimodal Public Transit Network

The goal of this paper is to develop a Decision Support System (DSS) as a journey planner in complex and large multimodal urban network called Rahyar. Rahyar attempts to identify the most desirable itinerary among all feasible alternatives. The desirability of an itinerary is measured by a disutility function, which is defined as a weighted sum of some criteria. The weight...

متن کامل

Semantic processing survey of spoken and written words in adolescents with cerebral palsy: Evidence from PALPA word-picture matching test

Objective: The present study aimed to assess and compare semantic processing of spoken and written words in adolescents with cerebral palsy and healthy adolescents. Method: The present study is quantitative in terms of type and experimental in terms of method. Examination Group consisted 30 adolescents with cerebral palsy aged 10 to 15 years were selected by convenience sampling method. All of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016